NIST Form-Based Handprint Recognition System
نویسندگان
چکیده
The National Institute of Standards and Technology (NIST) has developed a new release of a standard reference form-based handprint recognition system for evaluating optical character recognition. As with the first release, NIST is making the new recognition system freely available to the general public on CD-ROM. This source code testbed, written entirely in C, contains both the original and the new recognition systems. New utilities are provided for conducting generalized form registration, intelligent form removal with character stroke preservation, robust text-line isolation in handprinted paragraphs, adaptive character segmentation based on writing style, and sophisticated MultiLayer Perceptron (MLP) neural network classification. A software implementation of the machine learning algorithm used to train the new MLP is included in the test-bed, enabling recipients to train the neural network for pattern recognition applications other than character classification. A host of data structures and low-level utilities are also provided. These include the application of spatial histograms, affine image transformations, simple image morphology, skew correction, connected components, Karhunen Loève feature extraction, dictionary matching, and many more. The software test-bed has been successfully compiled and tested on a host of UNIX workstations including computers manufactured by Digital Equipment Corporation, Hewlett Packard, IBM, Silicon Graphics Incorporated, and Sun Microsystems.1 Approximately 25 person-years have been invested in this software test-bed, and it can be obtained free of charge on CD-ROM by sending a letter of request via postal mail or FAX to NIST. This report documents the new recognition software test-bed in terms of its installation, organization, and functionality.
منابع مشابه
Component-based handprint segmentation using adaptive writing style model
Building upon the utility of connected components, NIST has designed a new character segmentor based on statistically modeling the style of a person’s handwriting. Simple spatial features (the thickness of the pen stroke and the height of the handwriting) capture the characteristics of a particular writer’s style of handprint, enabling the new method to maintain a traditional character-level se...
متن کاملPublic domain optical character recognition
A public domain document processing system has been developed by the National Institute of Standards and Technology (NIST). The system is a standard reference form-based handprint recognition system for evaluating optical character recognition (OCR), and it is intended to provide a baseline of performance on an open application. The system’s source code, training data, performance assessment to...
متن کاملIntelligent System for Reading Handwriting on Forms
The National Institute of Standards and Technology (NIST) has developed a form-based handprint recognition system for reading information written on forms. This public domain software test-bed may be obtained from NIST free of charge on CD-ROM. The recognition system is modular in design and integrates algorithms from heterogeneous computational paradigms including artificial intelligence, imag...
متن کاملOff-line Handwriting Recognition from Forms
A public domain optical character recognition (OCR) system has been developed by the National Institute of Standards and Technology (NIST) to provide a baseline of performance on off-line handwriting recognition from forms. The system’s source code, training data, and performance assessment tools are all publicly available. The system recognizes the handprint written on Handwriting Sample Forms...
متن کاملGeneralized Form Registration Using Structure-Based Techniques
A new method for registering forms has been developed at the National Institute of Standards and Technology. This method automatically estimates the amount of rotation and translation in the image without any detailed knowledge of the form. This is accomplished through the automatic detection of dominant vertical and horizontal structures (lines) commonly found in forms. A general method for ro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994